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The surface transmembrane glycoprotein is responsi- 
ble for mediating virion attachment to cell and subse- 
quent virus-cell membrane fusion. However, the molec- 
ular mechanisms for the viral entry of coronaviruses 
remain poorly understood. The crystal structure of the 
fusion core of mouse hepatitis virus S protein, which rep- 
resents the first fusion core structure of any coronavirus, 
reveals a central hydrophobic coiled coil trimer sur- 
rounded by three helices in an oblique, antiparallel man- 
ner. This structure shares significant similarity with both 
the low pH-induced conformation of influenza hemagglu- 
tinin and fusion core of HIV gp41, indicating that the 
structure represents a fusion-active state formed after 
several conformational changes. Our results also indicate 
that the mechanisms for the viral fusion of coronaviruses 
are similar to those of influenza virus and HIV. The coiled 
coil structure has unique features, which are different 
from other viral fusion cores. Highly conserved heptad 
repeat 1 (HR1) and HR2 regions in coronavirus spike pro- 
teins indicate a similar three-dimensional structure 
among these fusion cores and common mechanisms for 
the viral fusion. We have proposed the binding regions of 
HR1 and HR2 of other coronaviruses and a structure 
model of their fusion core based on our mouse hepatitis 
virus fusion core structure and sequence alignment. Drug 
discovery strategies aimed at inhibiting viral entry by 
blocking hairpin formation may be applied to the inhibi- 
tion of a number of emerging infectious diseases, includ- 
ing severe acute respiratory syndrome. 


Coronaviruses are enveloped viruses with single-stranded, 
positive-sense genomic RNA that is 26—31 kb in length (1, 2). 
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The Coronaviridae exhibit a broad host range, infecting many 
mammalian and avian species and causing upper respiratory, 
hepatic, gastrointestinal, and central nervous system diseases. 
Coronaviruses in humans and fowl primarily cause upper res- 
piratory tract infections, whereas porcine and bovine coronavi- 
ruses establish enteric infections that result in severe economic 
loss (3). The coronaviruses also include mouse hepatitis virus 
(MHV),' infectious bronchitis virus, feline infectious peritonitis 
virus, and the newly emergent human severe acute respiratory 
syndrome-associated coronavirus (HcoV-SARS) (4). 

The surface glycoproteins of enveloped viruses play essential 
roles in viral entry into cells by mediating virion attachment to 
cells and the virus-cell membrane fusion, the initial events of 
the viral infections. The spike (S) protein is the sole viral 
enveloped glycoprotein responsible for cell entry in coronavi- 
ruses. It binds to the cell surface receptor and mediates subse- 
quent fusion of the viral and cellular membranes (5). Under the 
electron microscope, the spike proteins can be clearly seen as 
20-nm-long surface projections on the virion membrane (6). 

The spike proteins of coronaviruses share several features 
with other viral glycoproteins mediating viral entry, including 
the hemagglutinin (HA) protein of influenza virus, gp160 of 
human immunodeficiency virus (HIV) and simian immunode- 
ficiency virus (SIV), GP of Ebola virus, and fusion protein of 
paramyxovirus (7, 8). These glycoproteins are all synthesized 
as single polypeptide precursors that oligomerize in the endo- 
plasmic reticulum to form trimers. Most of the enveloped pro- 
teins with fusion activity contain two noncovalently associated 
subunits: S1 + S2 in coronaviruses, HA1 + HA2 in influenza 
viruses, gp120 + gp41 in HIV/SIV, GP1 + GP2 in Ebola virus, 
and F1 + F2 in paramyxoviruses, all of which are generated by 
proteolytic cleavage. Nevertheless, some enveloped proteins 
have no cleavage in their precursors and yet still maintain 
fusion activity, such as the S proteins of some coronaviruses (9) 
and GP of Ebola virus (10). 

A hydrophobic region in the membrane-anchored subunit of 
enveloped proteins, termed the fusion peptide in class I fusion 
proteins, has been shown to insert into the cellular membranes 
during the fusion process (11, 12). The regions following the 
fusion peptide have a 4-3 heptad repeat (HR) of hydrophobic 
residues, a sequence feature characteristic of coiled coils. The 
first heptad repeat, termed HR1 (HRA or N peptide), is fol- 
lowed by a short spacer domain and a second heptad repeat, 
termed HR2 (HRB or C-peptide), followed by another short 


1 The abbreviations used are: MHV, mouse hepatitis virus; HA, he- 
magglutinin; HIV, human immunodeficiency virus; HR, heptad repeat; 
MES, 4-morpholineethanesulfonic acid; MMLV, Moloney murine leuke- 
mia virus; S protein, spike protein; SARS, severe acute respiratory 
syndrome; SIV, simian immunodeficiency virus; TM, transmembrane. 
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spacer and the transmembrane (TM) anchor (13). Biochemical 
and structural analyses of HA, (14), HIV-1/SIV gp41 (15-21), 
Ebola virus GP2 (22, 23), and SV5F1 (24) indicate that these 
heptad repeat regions form six-helix bundles. The N-terminal 
heptad repeat forms a central coiled coil, which is surrounded 
by three HR2 helices in an oblique, antiparallel manner. 

Among all enveloped glycoproteins, the membrane fusion 
mechanism of the HA of influenza viruses has been studied in 
greater detail (7, 8, 25). HA is proteolytically cleaved to gener- 
ate a receptor binding subunit (HA,) and an anchored subunit 
(HA,) containing the fusion peptide. Numerous evidences sug- 
gest that the HA of influenza undergoes a conformational 
change, from a native (nonfusogenic) to fusion-active (fuso- 
genic) state during the viral fusion process. In the native HA, 
part of the heptad repeat region of HA, forms a nonhelical loop 
(26) but converts into a coiled coil when exposed to low pH (14). 
The later conformation is generally regarded to be a fusogenic 
state because the low pH also activates influenza membrane 
fusion. This conformation change is also the basis of the 
“spring-loaded mechanism” for activation of viral fusion (27). 

The spike protein of MHV A59 has been identified as a class 
I fusion protein and shares common features with other viral 
fusion proteins (28). However, there are several significant 
differences between the membrane-anchored subunits of coro- 
navirus spike proteins and HA,, gp41, GP2, and SV5F1. First, 
the ectodomain is much larger (550 residues versus 120-380 
residues) in size in S protein. Second, the HR1 region is pre- 
dicted to be much larger (more than 100 residues versus 30—60 
residues) in size by the learn-coil VMF program (29) and also 
verified by proteinase K-resistant experiments (28). Third, the 
putative fusion peptides of all other viral fusion proteins are 
located at the N terminus of the membrane-proximal subunits, 
whereas the S protein features an internal fusion peptide. 

Highly conserved HR1 and HR2 regions in coronavirus spike 
proteins suggest that they share similar three-dimensional 
fusion core structures and a common mechanism for viral fu- 
sion. The binding region of the HR1-HR2 complex of other 
coronaviruses and likely structures of their fusion cores can be 
proposed based on MHV fusion core structure and sequence 
alignment. Analogous to HIV C-peptides, HR2 peptides of coro- 
naviruses are likely to act in a dominant-negative manner to 
inhibit hairpin formation, thereby inhibiting viral entry. Thus, 
drug discovery strategies aimed at inhibiting viral entry by 
blocking hairpin formation may also be applied to the inhibi- 
tion of emerging infectious diseases such as SARS. 

Although previous biochemical and electron microscopic 
analyses have shown that the HR1-HR2 complex in the S 
protein of MHV forms a fusion core (28), the exact binding 
region of the HR1-HR2 complex and the detailed structure of 
the fusion core remain unknown. Here we report the determi- 
nation of the crystal structure of the fusion core of MHV A59 
spike protein to 2.5 A resolution by x-ray crystallography and 
discuss the implications of the structure for coronavirus mem- 
brane fusion. 


EXPERIMENTAL PROCEDURES 


Expression, Purification, and Crystallization—The expression, puri- 
fication, and preliminary crystallographic studies of the MHV 2-Helix 
protein have been described elsewhere.” The PCR-directed gene was 
inserted into pET22b (Novagen) vector, and the selenomethionine MHV 
2-Helix derivative was expressed in M9 medium containing 60 mg 
liter”? selenomethionine in Escherichia coli strain BL21 (DE3). The 
product was purified by nickel-nitrilotriacetic acid affinity chromatog- 
raphy followed by gel filtration chromatography. The purified MHV 
2-Helix derivative was dialyzed against 10 mm Tris, pH 8.0, 10 mm NaCl 


2Y. Xu, Z. Bai, L. Qin, X. Li, G. F. Gao, and Z. Rao, submitted for 
publication. 
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and concentrated to 8 mg ml~’. Crystals with good diffracting quality 
could be obtained in 0.1 M MES, pH 6.5, 10% PEG 4000 (v/v), 8% 
dimethyl sulfoxide (v/v), 5 mm hexaminecobalt trichloride after 3 days. 
The expression, purification, and crystallization of nMHV is the same 
as for MHV 2-Helix. Crystals with good diffracting quality could be 
obtained in 0.1 M MES, pH 6.5, 138% PEG 4000 (v/v), 5% dimethyl 
sulfoxide (v/v). 

Data Collection and Processing—The MHV 2-Helix crystal was 
mounted on nylon loops and flash-frozen in an Oxford Cryosystems cold 
nitrogen gas stream at 100 K. Multiple wavelength anomalous disper- 
sion data were collected by a rotation method using a MarCCD detector 
with synchrotron radiation on beamline 3W1A of the Beijing Synchro- 
tron Radiation Facility. Data were collected from a single selenomethio- 
nyl derivative crystal at peak (0.9799 A), inflection (0.9801 A), and 
remote (0.9000 A) wavelengths to 2.5 A. Data collection from the nMHV 
2-Helix crystal was performed in-house on a Rigaku RU2000 rotating 
anode x-ray generator operated at 48 kV and 98 mA (Cu Ka; A = 1.5418 
A) with a MAR 345 image plate detector. The crystal was mounted on 
nylon loops and flash-frozen at 100 K using an Oxford Cryosystems cold 
nitrogen gas stream. Data were indexed and scaled using DENZO and 
SCALEPACK programs (30). Data collection statistics are shown in 
Table I. 

Phase Determination and Model Refinement—For MHV 2-Helix 
structural determination, initial multiple wavelength anomalous dis- 
persion phasing steps were performed using SOLVE (31) and followed 
by density modification by RESOLVE (32). The program O (33) was 
used for viewing electron density maps and manual building. The initial 
structure was subsequently refined to a final R-value of 22.4% and free 
R-value of 29.1%. The quality of the structure was verified by PRO- 
CHECK (34). None of the main chain torsion angles is located in 
disallowed regions of the Ramachandran plot. The statistics for the 
structure determination and refinement are summarized in Table I. 
The figures were generated with the programs GRASP (35), SPDBView 
(36), and MOLSCRIPT (37). 


RESULTS AND DISCUSSION 


Structure Determination—Two peptides (HR1 and HR2, Fig. 
1, A and B) encompassing the N-terminal and C-terminal hep- 
tad repeats of the MHV spike protein assemble into a stable 
trimer of heterodimers (28). The HR1 and HR2 regions of the 
MHV S protein consist of residues 968-1027 and residues 
1216-1254, respectively (Fig. 1, A and B). The fusion core of the 
MHV spike protein was prepared as a single chain by linking 
the HR1 and HR2 domains via an eight-amino acid linker 
(GGSGGSGG, single amino acid abbreviation used here). The 
constructs and the encoded proteins were also called MHV 
2-Helix (Fig. 1A). The preparation and characterization of the 
2-Helix proteins will be reported elsewhere.” The MHV 2-Helix 
forms crystals that have unit cell parameters a = b = 48.3 A, 
c = 199.6A , a = B = 90°, y = 120° and belong to the space 
group R3. The crystals contain two MHV 2-Helix molecules/ 
asymmetric unit, and the diffraction pattern extends to 2.5 
A resolution.” 

The crystal structure of the MHV 2-Helix was solved by 
multiple wavelength anomalous dispersion from a single sel- 
enomethionyl derivative crystal. Four selenium sites could be 
located in one asymmetric unit from Patterson maps calculated 
using the program CNS 1.0 (38). The experimental electron 
density map was easily interpretable in the helical regions. The 
model was improved further by cycles of manual building and 
refinement using the programs O (33) and CNS (88). The struc- 
ture was subsequently refined to a final resolution of 2.5 A with 
an R-value of 22.4% and free R-value of 29.1%. The final model 
statistics are summarized in Table I. 

Description of the Structure—In the three-dimensional struc- 
ture of the MHV 2-Helix, the fusion core has a rod-shaped 
structure ~80 A in length and a maximum diameter of 28 A. 
The complex is a six-helix bundle comprising a trimer of MHV 
2-Helix. The center of this bundle consists of a parallel trimeric 
coiled coil of three HR1 helices that were packed by three 
antiparallel HR2 helices (Fig. 2, A and B). The N terminus of 
HR1 and the C terminus of HR2 are located at the same end of 
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TABLE I 
Data collection and model refinement statistics 
Data set statistics 
MB 2 Mele nMHV 2-Helix 
Peak Edge Remote native 
Space group , R3 R3 R3 R3 
Unit cell parameters (A) 48.4 48.4 200.0 48.47 48.47 200.1 48.3 48.3 199.4 51.6 51.6 198.2 
Wavelength (A) _ 0.9799 0.9801 0.9000 1.5418 
Resolution limit (A) 2.4 2.4 2.4 2.06 
Observed reflections 49,717 50,167 49,352 83,487 
Unique reflections 6,842 6,867 6,804 12,068 
Completeness (%) 100(100)* 100(100) 100(100) 98.6(100) 
(I/o(D)) 14.3(4.1) 14.6(4.5) 13.7(3.8) 28.7(4.3) 
Reseces” (%) 9.4(34.0) 8.4(31.0) 9.3(36.1) 9.4(29.9) 
Final refinement statistics 
MHV 2-Helix nMHV 2-Helix 

Ryork’ (%) 22.4 26.2 

fret (%) . 29.1 29.8 
Resolution range (A) 35-2.5 50-2.06 
Total reflections used 5,833 11,689 
No. of reflections in working set 5,501 11,092 
No. of reflections in test set 332 597 
r.m.s.d.° bonds(A) 0.012 0.012 
r.m.s.d. angles(°) 1.8 1.4 


“ Numbers in parentheses correspond to the highest resolution shell. 


” Rinerge = Yn 2din — Un) [RI UI,), where (I,) is the mean of the observations J;,, of reflection h. 


«work = X(Fous 
Reree ot \ i 
°y.m.s.d., root mean square deviation from ideal geometry. 


~~ Feaicll)/ Foys- 


the six-helix bundles, placing the fusion peptide and TM do- 
mains close together. A region of about 190 amino acids would 
be located at the other end of the six-helix bundle between HR1 
and HR2. The linker and several terminal residues were dis- 
ordered in both molecules. In one asymmetric unit, one mole- 
cule includes residues 970-1023 in HR1 and 1216-1252 in 
HR2, and the other molecule includes residues 969-1022 in 
HR1 and 1216-1254 in HR2. The two trimers are created by 
the same 3-fold axis of the crystallographic unit cells and are 
both parallel with the 3-fold axis of the crystallographic unit 
cells, and there is about 30 degrees rotation in the orientation 
parallel with the 3-fold axis between the two trimers. The root 
mean square deviation of the two molecules in one asymmetric 
unit is 0.36 A. There is only one weak hydrogen bond between 
the two parallel trimers, from OH (Tyr???) to O (Glu??**) with 
a distance of 3.34 A. 

Residues 969-1022 of HR1 fold into a 15-turn a-helix 
stretching the entire length of the coiled coil. As in other 
naturally occurring coiled coils of the fusion core, the residues 
in the a and d positions of HR1 are predominantly hydrophobic 
(Fig. 1B). A sequence alignment of MHV with other represent- 
ative coronavirus spike proteins shows that the residues at 
these two heptad repeat positions are highly conserved 
(Fig. 1B). 

Residues 1232-1247 of HR2 form a 5-turn amphipathic a-he- 
lix, whereas residues 1216-1231 and 1248-1254 form two 
extended chains at the N and C terminus of HR2, respectively. 
Each HR2 fits into the long grooves formed by the interface of 
the three HR1 helices, and no interaction is observed between 
individual HR2 helices (Fig. 2, A and B). The C terminus of 
HR2 ends with Glu!”**, which is aligned with Gln®”° of HR1; 
Gln®”° is also the N terminus of the HR1 domain. The N 
terminus of HR2 starts with Asp’*!°, which is aligned with 
Tle?°?3 of HR1 (Fig. 2C). 

Linker between HRI and HR2 and nMHV 2-Helix Struc- 
ture—To verify whether the linker (GGSGGSGG) between HR1 
and HR2 affects the natural structure of the MHV 2-Helix, we 
made a new construct (termed nMHV 2-Helix) that includes a 
shorter HR1 and longer linker. In the nMHV 2-Helix construct, 


is the R-factor for a subset (5%) of reflections that was selected prior refinement calculations and not included in the refinement. 


the HR1 consists of residues 968-1017, the HR2 consists of 
residues 1216-1254, and the new fusion core was prepared as 
a single chain by linking the HR1 and HR2 domains via a 
22-amino acid linker (LVPRGSGGSGGSGGLEVLFQGP), 
which is flexible and long enough to allow the HR1 and HR2 to 
form a natural interaction (Fig. 1A). The nMHV 2-Helix forms 
crystals in space group R3 with lattice dimensions of a = 6 = 
51.6A,c =198.2A,a= B = 90°, y = 120°. The crystals contain 
two 2-Helix molecules/asymmetric unit and diffract x-rays to a 
resolution of at least 2.1 A. 

The crystal structure of nMHV 2-Helix was determined by 
molecular replacement with the MHV 2-Helix structure as a 
search model. Rotation and translation function searches were 
performed in CNS (38). The model was improved further by 
cycles of manual building and refinement using the programs O 
(33) and CNS (88). The final R-value and free R-value for the 
refinement are 26.2 and 29.8%, respectively. The final model 
statistics are listed in Table I. 

The nMHV 2-Helix structure is largely similar to the MHV 
2-Helix structure, with the exception of several residues at the 
N terminus of HR2 (Fig. 2C). The overall root mean square 
deviation between the two structures is 0.48 A, which is calcu- 
lated using the CCP4 program LSQKAB. The nMHV 2-Helix 
structure also contains two molecules/asymmetric unit. One 
molecule includes residues 968-1017 in HR1 and 1224-1254 in 
HR2, whereas the other molecule includes residues 970-1017 
in HR1 and 1229-1254 in HR2. The linker is also disordered 
and cannot be traced in the structure. The N terminus of HR2 
cannot be seen in the structure because the C terminus of HR1, 
which is important for binding the N terminus of HR2, has 
been discarded in the new construct. The linker is long and 
flexible enough, so we can conclude that both structures surely 
represent the natural structure of the complex of HR1 and HR2 
because the choice of linker does not affect the real interaction 
between the two heptad repeat regions. We will focus our 
following structural analysis on MHV 2-Helix structure. 

Interactions between HR1 and HR2—Three HR2 helices pack 
obliquely against the outside of the HR1 coiled coil trimer in an 
antiparallel orientation. The HR2 helices interact with HR1 


Crystal Structure of MHV Spike Protein Fusion Core 


SS FP HRI HR2 TM 
$1°S2 


A 


30517 


HR Linker HR? 
| mm 


MHV 2-Helix 
MENQKM1 ASAFNNALGAIQDGFDAT NSALGKIQSVVNANAEALNNLLNQLSNRFGAISASL 5 
C___ cescesee DLSLDFEKLNVTLLDLTYEMNRIQDATKKLNESY INLKE 
nMHV 2-Helix 


MENQKMTASAFNNALGAIQDGFDATNSALGKIQSVVNANAEALNNLLNQLS —____4 


Fic. 1. Structure determination of 
the MHV spike protein fusion core 
trimer. A, schematic representation of B 


coronavirus MHV A59 spike protein and HR1 

the MHV 2-Helix and nMHV 2-Helix con- MEV. ASS 

structs. S1 and S2 are formed after pro- ae ee 
ov- 


teolytic cleavage (vertical arrow) and non- 
covalently linked. The enveloped protein 
has an N-terminal signal sequence (SS) 
and a TM domain adjacent to the C ter- 
minus. S2 contains two HR regions 


IBV 
HeoV-229E 
FIPV 


(hatched bars), termed HR1 and HR2 as ona 
indicated. FP (hatched bars) is a putative ade 
fusion peptide followed by HR1 region. HeoV-229E 
For the MHV 2-Helix, two HR regions = 
were linked to a single polypeptide with 

an 8-residue linker (GGSGGSGG). For HR2 
the nMHV 2-Helix, HR2 and a shortened MHV_A59 
HR1 were linked with a 22-amino acid tones 


linker (LVPRGSGGSGGSGGLEVLF QGP). 
B, sequence alignment of coronavirus spike 
protein HR1 and HR2 regions. Letters above 
the sequence indicate the predicted hydro- 
phobic HR a and d residues, which are C 
highly conserved. C, helical wheel represen- 
tation of HR1 and HR2. Three HR1 helices 
and one HR2 helix are represented as helical 
wheel projections. The view is from the top of 
the structure. The three central HR1 helices 
form a central hydrophobic core with the 
interaction of residues in the a and d posi- 
tions. The three HR2 helices pack against 
these hydrophobic surface grooves through 
interactions with residues in the a and d 
positions in HR2 and e and g positions in 
HR1. These residues, mediating the interac- 
tions between HR1 and HR2, are always 
hydrophobic and conserved (see B). 


IBV 
HcoV-229E 
FIPV 


mainly through hydrophobic residues in three grooves on the 
surface of the central coiled coil trimer (see Fig. 44). Sequence 
comparison between MHV and other coronavirus spike pro- 
teins shows that residues contributing to the HR1/HR2 inter- 
action (e and g positions in HR1, a and d positions in HR2) are 
highly conserved (Fig. 1B). 

This pattern of sequence conservation can also be shown by 
a helical wheel representation of three HR1 helices and one 
HR2 helix (Fig. 1C). In this diagram, residues in the a and d 
positions of HR2 pack against residues in the e and g positions 
of HR1, mainly through hydrophobic interactions. Sequence 


CLVPRGSGGSGGSGGLEVLFQGP—DLSLDFEKLNVTLLDLTYEMNRIQDAI KKLNESY INLKE 
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comparison between MHV and HcoV-OC43 spike proteins 
shows that no nonconservative changes exist at the e and g 
positions of HR1, and only three such changes (L to I, L to F 
and I to L) occur in HR2 at the a and d positions. In contrast, 
7 of 26 nonconservative changes occur at the outside /, b, and c 
positions in HR1, and 5 of 12 nonconservative changes occur at 
positions other than a and d in the helical region (1232-1248) 
of HR2 (Fig. 1B). 

Comparison with Other Fusion Protein Structures—The 
structure of the MHV fusion core was compared with the 
known structures of other viral fusion proteins (Fig. 3A). Al- 
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MHYV 2-Helix 


nMHV 2-Helix 


Fic. 2. Overall views of the fusion core structure and super- 
position of nMHV (new construct for MHV fusion core) and 
MHV fusion core. A, top view of the MHV fusion core structure 
showing the 3-fold axis of the trimer. B, side view of the MHV fusion 
core structure showing the six-helix bundle. C, side view showing the 
superposition of nMHV fusion core (colored in blue) and MHV fusion 
core (colored in yellow). The columns at both sides of the map represent 
two HR1 and HR2 regions of nMHV and MHV fusion cores. The number 
at the end of these columns represents the end residues in the 
two structures. 


though there are significant differences in the sequences, sizes, 
and structural properties of these viral fusion proteins, the 
similarities in their overall structures suggest a common mech- 
anism for membrane fusion. These fusion core proteins are all 
trimeric coiled coils with the putative fusion peptides located at 
the N-terminal end of HR1 and the TM domains located at the 
C-terminal end of HR2 (7). The three HR2 polypeptides that 
form the outer layer of the central core vary in conformation 
among these structures, but they always form a-helices and 
always pack antiparallel to the interior coiled coil. In this 
pattern, which has been proposed to be a fusogenic conforma- 
tion, all of these fusion proteins would have their fusion pep- 
tides and TM anchors aligned at the same end of the coiled coil. 

There are three major differences between the MHV and 
other viral fusion core structures and their relevant regions, 
indicating unique features of fusion core structure in coronavi- 
rus S proteins. First, the conformation of HR2 in the fusion core 
structure is different from those of all other fusion core HR2 
regions (Fig. 3A). In the MHV fusion core structure, a major 
5-turn a-helix and a single-turn a-helix could be observed, and 
the remaining parts are extended segments. We can divide the 
HR2 polypeptide into five parts (Fig. 4A, right), parts 1-5. Part 
4 is a typical 3-4-3-4-3 pattern and forms a 5-turn a-helix of 
which the residues at positions a and d pack against residues 
at the e and g positions of the HR1. Part 2 is also an a-helix that 
exhibits the 3-4 spacing pattern, although it has only four 
residues, FEKL. Of these four residues, the residues in the a 
position (Phe!??+) and d position (Leu???) also pack against 
hydrophobic grooves formed on the surface of the HR1 core. 
Parts 1, 3, and 5 should also be a-helical based on prediction by 
the learn-coil VMF program (29). However, the structure shows 
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they are all extended to form a strand-like conformation. In 
these three parts, there is an interesting O-X-O motif where O 
represents hydrophobic residues, and X represents any residue 
but is generally hydrophilic. Part 1 contains the residues LSL, 
part 3 contains the residues VTL and LDL, and part 5 contains 
the residues INL. The two hydrophobic residues in these motifs 
pack against hydrophobic grooves on the surface of the HR1 
core, either facing the central core or aligning with the hydro- 
phobic groove. As a result, the O residues form hydrophobic 
interactions to stabilize the six-helix bundle, leaving the X 
residue directed into solvent. This pattern, which we think is a 
major reason why these three parts do not form a-helices, is 
also observed in the structure of SV5F (24), HRSV F (39), 
MMLV TM (40) and Ebola GP2 (41) (Fig. 4B). In these struc- 
tures, partial regions in HR2 or C-peptide are extended and 
strand-like rather than a-helical. In the 3-4-3-4-3 pattern in 
HR2 of these glycoproteins, the presence of an O-X-O motif 
would result in hydrophobic residues interacting with the hy- 
drophobic grooves on the surface of HR1 core, thus destroying 
the typical a-helix. In these three-dimensional structures, the 
distance between the two hydrophobic residues in the O-X-O 
motifs is about 5 A, which is also the distance between the two 
adjacent helices. Thus, the two hydrophobic residues could 
exactly pack against the grooves of the central coiled coil 
formed by three HR1 helices. This compatibility of HR2 seg- 
ments makes the fusion core more stable in solvent because 
most of the hydrophobic residues in HR2 are packed against 
the central core, leaving the hydrophilic residues exposed to 
solvent. This pattern also explains why not all residues in HR2 
of fusion cores from many other viral proteins form a-helices 
and why HR1 structures of these fusion cores are highly con- 
served, whereas HR2 regions always differ in their three-di- 
mensional conformations (Fig. 3A). 

Second, a proteinase K-resistant fusion core of MHV A59 
spike protein has been reported by Bosch et al. (28). After 
digestion by proteinase K, the fusion core comprising residues 
958-1040 in HR1 and 1216-1254 in HR2 remains intact. In 
this fusion core, the HR1 region is about 30 residues longer 
than the HR1 region of the fusion core we constructed. Al- 
though the fusion core structure we determined here is only 
part of the proteinase K-resistant fusion core, we propose that 
residues 958-967 and 1023-1040 in HR1 would also form 
coiled coils in the natural fusion core on the basis of their 
resistant capacity and other biochemical analyses (28). In the 
proteinase K-resistant fusion core of MHV spike protein, the 
central coiled coil HR1 region has about 80 amino acids, which 
is considerably longer than HR1 segments in other fusion cores 
such as HIV gp41 and SV5F1. This length is comparable with 
that of the fusion core in influenza HA, whose mechanisms for 
the membrane fusion have been studied in extensive detail 
(25). In addition, the HR1 region of the MHV spike protein is 
predicted to contain more than 100 amino acids by the learn- 
coil VMF program (29). This long helical coiled coil might be 
consistent with the long sequence of S protein, which is more 
than 1200 residues, to form the central skeleton of spikes on 
the surface of coronavirus. 

Third, although the fusion peptide is not part of the fusion 
core, it is also very important for investigating viral fusion 
mechanism. The putative fusion peptide of the MHV A59 spike 
protein is located at the N-terminal end of the HR1 region, and 
the cleavage site of the spike protein is about 250 amino acids 
away from the fusion peptide (28). The cleavage sites of other 
class I viral fusion proteins are all typically located adjacent to 
the fusion peptides (13). In the latter pattern, the likely role of 
the six-helix bundle structure is to facilitate juxtaposition of 
the viral and cellular membranes by bringing the fusion pep- 
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Fic. 3. Viral fusion proteins and models for membrane fusion. A, comparison of MHV fusion core with other viral fusion protein structures. 
The proteins under comparison include SV5F, Ebola GP2, HIV gp41, MMLV Env-TM, and low pH-induced influenza virus HA, tBHA2. Top and 
side views are shown for the six fusion core structures. B, model for coronavirus-mediated membrane fusion. The first state is the native 
conformation of coronavirus spike protein on the surface of viral membrane. It has been reported that the spike protein is trimeric in this 
conformation and about 200 A in length (6), but the exact structure of the full-length protein remains unknown. The second state is the prehairpin 
state of the S2 subunit. After several conformational changes, the fusion peptide inserts into the cellular membrane with the aid of other regions 
of S protein and possibly including the receptor. Although the internal fusion peptide is not exposed at the N-terminal of S2, it could insert into 
part of the target membrane by means of some hydrophobic residues. This insertion would be stable enough to drive the membrane motion with 
the conformational changes of HR1 region, which is adjacent to the fusion peptide. The third state is conformational change and juxtaposition of 
the target and viral membranes. With the help of other regions of S protein, the HR1 and HR2 regions move together and facilitate juxtaposition 
of the cellular and viral membrane. The last state is the postfusion conformation. The coiled coil will reorient with its long axis parallel to the 


membrane surface. The fused cellular and viral membranes make it possible for subsequent viral infections. 


tide, which inserts into the cellular membrane, close to the 
transmembrane segment, which is anchored in the viral mem- 
brane (7). In the case of the MHV spike protein, the question of 
why the HR1 region and fusion peptide are so far away from 
the cleavage site remains unknown. Nevertheless, some vi- 
ruses such as coronavirus (9) and Ebola virus (10) do not have 
cleavage sites in fusion proteins but still retain their fusion 
activity. This indicates that the location of the fusion peptides, 
whether exposed at the N terminus of the membrane-anchored 
polypeptide or not, is not an essential requirement for the viral 
fusion. We will give a possible mechanism in the further 
discussion. 

Evidence for the Conformational Change—Structural studies 
of the influenza virus HA and HIV gp41 have established a 
paradigm for understanding the mechanisms of viral and cel- 
lular membrane fusion (7). For coronaviruses, direct evidence 
for the conformational change in spike protein is lacking, al- 
though the crystal structure of the MHV fusion core bears 
similarity to these fusion-active state molecules. The structure 
of the MHV 2-Helix could correspond to the fusion core of MHV 
spike protein in either the fusogenic or the native form of the 
envelope glycoprotein, or both. Several considerations provide 
good evidence that the fusion core in the crystal structure 
presented here is the final, stable form of the protein, which is 
a fusion-active state following one or more conformational 
changes. 

First, the fusion core of MHV spike protein is exceedingly 


stable to both thermal denaturation and proteinase K digestion 
(28). The six-helix bundle has a melting temperature of about 
85 °C and could not be separated in general SDS-PAGE unless 
boiled at 100 °C with a loading buffer containing a high con- 
centration of SDS (28). These properties indicate that the com- 
plex is very stable and could not be dissociated by any biolog- 
ically relevant interaction. This form of fusion core must be 
present at the later stage of conformational changes for viral 
fusion, although it is not known whether the complex main- 
tains this conformation throughout the entire process. 

Second, virus-cell entry inhibition and cell-cell fusion inhibi- 
tion experiments (28) also provide strong evidence that the 
fusion core is formed after one or more conformational changes. 
HR2 of the MHV 2-Helix could block viral entry and cell-cell 
fusion in a concentration-dependent manner and appears to be 
a potent inhibitor (28). In HIV gp41, the C-peptide and its 
derivatives have been shown to act as dominant-negative in- 
hibitors by binding to the endogenous N-peptide coiled coil 
trimer within viral gp41 (19, 42, 43). A reasonable interpreta- 
tion of the data for the MHV fusion core is that HR2 functions 
in a dominant-negative manner similar to the C-peptide in HIV 
gp41 by binding to the transiently exposed coiled coil in the 
prehairpin intermediate and thus preventing the conforma- 
tional changes required for viral fusion. 

Third, mutations in the MHV spike protein that abolish 
viral-cell fusion activity often map to the HR1 or HR2 residues, 
which are expected to stabilize the fusion core structure re- 
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Fic. 4. O-X-O motifs in HR2 regions of MHV and the comparison with those of other fusion proteins. A. Left and center, surface map 
showing the hydrophobic grooves on the surface of three central HR1 helices. Three HR2 helices pack against the hydrophobic groove in an 
antiparallel manner. The helical regions in HR2 extended regions could be observed clearly. The helical region of HR2 just packs against the deep 
groove, and the extended region packs against the shallow groove. Right, detailed structure of O-X-O motifs in MHV HR2 region. One HR2 helix 
is divided into five parts based on its secondary structure. The helical regions (parts 2 and 4) HR2 are colored in red, and extended regions (parts 
1, 3, and 5) are colored in blue. The essential residues of the three extended regions and O-X-O motifs in these regions are shown; residues colored 
in green represent the hydrophobic residues in O-X-O motifs. The three panels on the left show the enlarged images of parts 1, 3, and 5. The 
hydrophobic residues in these motifs are all packed against the hydrophobic grooves on the surface of three HR1 helices. B, detailed structures of 
O-X-O motifs in other fusion proteins including SV5F, HRSV F, MMLV Env-TM, and Ebola GP2. They all contain similar motifs in HR2 regions. 
The regions in which O-X-O motifs are located form extended regions but not a-helices, in a way similar to the MHV 2-Helix. 


ported here. These studies show that mutations in some essen- 
tial positions in HR regions abolish infectivity and membrane 
fusion (44, 45). The L981K and F977K mutations are particu- 
larly noteworthy because cells expressing mutant spike pro- 
teins with one of these mutations are almost completely defec- 
tive in membrane fusion, although the surface expression level 
of spike proteins remains the same as for the wild type (44). 
Residue Leu®*’ is in the a position and Phe®”’ in the d position 
of the HR1 peptide, and thus both are essential for the forma- 
tion of the central hydrophobic coiled coil (Fig. 1C). The L981K 
and F977K mutations in HR1 region substitute hydrophobic 
residues with hydrophilic residues, destroying the hydrophobic 
interaction and the formation of the six-helix bundles and 
subsequently abolishing the membrane fusion. In contrast, 


replacement of the same residues with other hydrophobic 
amino acids (F977L, L981I) does not reduce fusion activity 
(44). L1224A/L1231A, L1224A/11238A, L1224A/L.1245A, and 
11231A/L1245A mutations in HR2 also abolish fusion activity 
greatly (45). In these double substitution mutants, the Leu and 
Tle residues are both in d positions in HR2 and are also very 
important for the formation of the fusion core (Fig. 1C). These 
double substitution mutants could not maintain a stable coiled 
coil structure even though Leu or Ile was changed for Ala, a 
hydrophobic residue. The locations of these particular muta- 
tions indicate that the interactions between HR1 and HR2 are 
critical for membrane fusion. 

Lastly, the structural similarities between the MHV 2-Helix 
complex, the fusion core of low pH-induced conformation of 
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influenza HA2 (14), and the structure of HIV gp41 (19), each of 
which has been proposed to have fusion-active conformations, 
indicate that the MHV 2-Helix structure studied here repre- 
sents the core of the fusogenic conformation of spike protein 
after conformational change (Fig. 3A). In all three structures, 
the putative hydrophobic fusion peptides are located close to 
the N-terminal end of the HR1 region, which forms a central 
coiled coil. Three strands of HR2 which pack against the coiled 
coil trimer in an antiparallel manner stabilize this hydrophobic 
coiled coil. These common features suggest that the MHV spike 
protein also possesses a conformational change mechanism 
similar to influenza HA and HIV gp41 (7). 

Implications for Models of Membrane Fusion Mechanisms— 
Although we have no structures of full-length MHV spike pro- 
tein either in the prefusion or postfusion state, we can propose 
a model for MHV membrane fusion mechanism based on the 
fusion core structure studied here and previous analysis of 
spike proteins. Current models for the class I viral fusion 
mechanisms suggest that the exposed fusion peptide located at 
the N terminus of the membrane-anchored subunit may be 
important for the juxtaposition of two membranes prior to 
fusion (7). For coronaviruses, the S2 fragment contains a pu- 
tative internal fusion peptide that is not exposed at the N 
terminus (44). This pattern of internal fusion peptide in the 
MHV spike protein is reminiscent of fusion loops in class II 
viral fusion proteins, which are internally located in the fusion 
protein (46, 47). Recent structural studies of the dengue virus 
envelope protein show that the highly conserved internal fu- 
sion loop penetrates only ~6 A into the hydrocarbon layer of 
the cellular membrane (46). In this structure, an aromatic 
anchor formed by Trp’®! and Phe!’ inserts into the cellular 
membrane. Studies of a 20-residue influenza virus A fusion 
peptide with a detergent micelle suggests a kinked a-helix, 
with the N and C termini embedded in the outer leaflet and the 
kink on the surface (48). 

Both the fusion loop from dengue virus E protein and the 
fusion peptide from influenza virus indicate that the anchoring 
into the lipid bilayer may not require complete insertion of the 
fusion peptide into the membrane. The exposure of the fusion 
peptide at the N terminus of the fusion protein is also not 
essential for the viral fusion. In this case, which also applies to 
several coronavirus spike proteins, the fusion peptides may 
insert into the membrane via several hydrophobic resides or a 
kinked hydrophobic loop (Fig. 3B). In our model shown in Fig. 
3B, the internal fusion peptide of spike proteins forms a small 
hydrophobic core, which would insert into the membrane after 
several conformational changes induced by receptors or pH 
changes. HR2 and HR1 would then form a coiled coil to facili- 
tate juxtaposition of the cellular and viral membranes, followed 
by virus-cell membrane fusion. 

In coronavirus spike proteins, S1 is mainly responsible for 
binding to receptor and S2 for fusion (5). The flexibility be- 
tween HR1 and HR2 is not sufficient enough to allow for 
complete conformational changes to occur, although the two 
heptad repeat regions have a great tendency to pack closely 
together. Instead, other intervening regions of the spike pro- 
tein should promote the formation of the fusion core. Further 
experiments should reveal how the exact conformational 
change occurs and how best to inhibit the viral fusion. 

Potential Inhibitors of Coronaviruses Entry into Cells—The 
sequence alignment of HR1 and HR2 regions in spike proteins 
among coronaviruses shows significant similarity in their hep- 
tad repeat regions (Fig. 1B). For example, between MHV and 
SARS, identity of HR1 is 60% and positive is 91%, identity of 
HR2 is 35 and positive is 85%. Important residues for six-helix 
bundle formation, located in the a, d, e, and g positions of HR1 
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and a and d positions of HR2, are all highly conserved. We can 
conclude that coronavirus spike proteins share a similar bind- 
ing region of the HR1-HR2 complex and three-dimensional 
fusion core structure. Analogous to the HIV C-peptides, the 
HR2 region of coronavirus spike proteins most likely functions 
in a dominant-negative manner by binding to the transiently 
exposed hydrophobic grooves in the prehairpin intermediate 
and consequently blocking the formation of the fusion-active 
hairpin structures (43). These strategies have been used suc- 
cessfully in fusion inhibitors design for HIV (7, 42, 49-51). A 
similar approach may be applied to identify inhibitors of coro- 
navirus infection. HR1 and HR2 regions and their derivatives 
are all potential inhibitors. Cavities and grooves on the surface 
of the central coiled coil are strong potential binding sites for 
small molecule inhibitors. 

In conclusion, the crystal structure of MHV fusion core 
shows the first fusion core structure of any coronavirus. Al- 
though the structure shares common features with those of 
other viral fusion proteins, it has unique characteristics that 
distinguish it from other fusion core structures. Sequence 
alignment of HR regions among coronaviruses indicates a sim- 
ilar structure among coronavirus spike proteins and suggests a 
common mechanism for viral fusion. This structure will also 
open an avenue toward the structure-based fusion inhibitor 
design of peptides, or peptide analogs, e.g. small molecules, 
targeted against emerging infectious diseases, such as SARS. 
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